Building an organic block storage service at CERN with Ceph
نویسندگان
چکیده
Emerging storage requirements, such as the need for block storage for both OpenStack VMs and file services like AFS and NFS, have motivated the development of a generic backend storage service for CERN IT. The goals for such a service include (a) vendor neutrality, (b) horizontal scalability with commodity hardware, (c) fault tolerance at the disk, host, and network levels, and (d) support for geo-replication. Ceph is an attractive option due to its native block device layer RBD which is built upon its scalable, reliable, and performant object storage system, RADOS. It can be considered an “organic” storage solution because of its ability to balance and heal itself while living on an ever-changing set of heterogeneous disk servers. This work will present the outcome of a petabyte-scale test deployment of Ceph by CERN IT. We will first present the architecture and configuration of our cluster, including a summary of best practices learned from the community and discovered internally. Next the results of various functionality and performance tests will be shown: the cluster has been used as a backend block storage system for AFS and NFS servers as well as a large OpenStack cluster at CERN. Finally, we will discuss the next steps and future possibilities for Ceph at CERN.
منابع مشابه
The Design and Implementation of AQuA: An Adaptive Quality of Service Aware Object-Based Storage Device
As storage systems are becoming larger and more complicated, they must handle an increasing variety of data storage needs. Storage system Quality of Service (QoS) has therefore become an important issue. Storage systems must be able to deliver satisfactory performance under both expected and unexpected workloads and, increasingly, must handle data (such as multimedia data) with real timeliness ...
متن کاملUnderstanding the Performance of Ceph Block Storage for Hyper-Converged Cloud with All Flash Storage
Hyper-converged cloud refers to an architecture that an operator runs compute and storage services on the same set of physical servers. Although the hyperconverged design comes with a number of benefits, it makes crucial operational tasks, such as capacity planning and cost analysis, fairly complicated. The problem becomes more onerous if we consider a complex distributed system, such as Ceph, ...
متن کاملEnabling Object Storage via shims for Grid Middleware
The Object Store model has quickly become the basis of most commercially successful mass storage infrastructure, backing so-called ”Cloud” storage such as Amazon S3, but also underlying the implementation of most parallel distributed storage systems. Many of the assumptions in Object Store design are similar, but not identical, to concepts in the design of Grid Storage Elements, although the re...
متن کاملBrados: Declarative, Programmable Object Storage
To meet the needs of a diverse and growing set of cloud-based applications, modern distributed storage frameworks expose a variety of composable subsystems as building blocks. This approach gives infrastructure programmers significant flexibility in implementing application-specific semantics while reusing trusted components. Unfortunately, in current storage systems the composition of subsyste...
متن کاملGeneralStore: Declarative Programmable Storage
Despite the broad and growing diversity of storage-intensive applications, storage interfaces have evolved little over time. Fears of vendor lock-in and loss or inaccessibility of legacy data have discouraged the evolution of the POSIX API to better support existing and emerging workloads. This has long been a source of consternation for applications developers, who are often required to duplic...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014